Fuzzy Imputation Method for Database Systems
نویسندگان
چکیده
The missing data and nonresponse problem is a usual difficulty of particular concern in medical and social science databases. Dealing with nonresponse can be a difficult matter and it is important to apply adequate missing data methods to obtain valid inference. Missing data is a very common problem in real data sets, and different methods to solve this problem have been developed. A simple and common strategy is to ignore missing values, thus reducing the size of the useful data set. The experience in databases has demonstrated the dangers of simply removing cases (listwise deletion) from the original data set, and deletion can introduce AbstrAct
منابع مشابه
Towards Missing Data Imputation: A Study of Fuzzy K-means Clustering Method
In this paper, we present a missing data imputation method based on one of the most popular techniques in Knowledge Discovery in Databases (KDD), i.e. clustering technique. We combine the clustering method with soft computing, which tends to be more tolerant of imprecision and uncertainty, and apply a fuzzy clustering algorithm to deal with incomplete data. Our experiments show that the fuzzy i...
متن کاملMicrosoft Word - ICAME09_opti_leslabay_final
There are many situations where input feature vectors are incomplete and methods to tackle the problem have been studied for a long time. A commonly used procedure is to replace each missing value with an imputation. This paper presents a method to perform categorical missing data imputation from numerical and categorical variables. The imputations are based on Simpson’s fuzzy min-max neural ne...
متن کاملMicrosoft Word - 5_.rtf
There are many situations where input feature vectors are incomplete and methods to tackle the problem have been studied for a long time. A commonly used procedure is to replace each missing value with an imputation. This paper presents a method to perform categorical missing data imputation from numerical and categorical variables. The imputations are based on Simpson’s fuzzy min-max neural ne...
متن کاملMicrosoft Word - Pilar Rey-del-Castillo.rtf
There are many situations where input feature vectors are incomplete and methods to tackle the problem have been studied for a long time. A commonly used procedure is to replace each missing value with an imputation. This paper presents a method to perform categorical missing data imputation from numerical and categorical variables. The imputations are based on Simpson’s fuzzy min-max neural ne...
متن کاملOn a Fuzzy c-means Algorithm for Mixed Incomplete Data Using Partial Distance and Imputation
The focus of fuzzy c-means clustering method is normally used on numerical data. However, most data existing in databases are both categorical and numerical. To date, clustering methods have been developed to analyze only complete data. Although we sometimes encounter data sets that contain one or more missing feature values (incomplete data), traditional clustering methods cannot be used for s...
متن کامل